Uummannaq
Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering
Zhong, Victor, Xiong, Caiming, Keskar, Nitish Shirish, Socher, Richard
End-to-end neural models have made significant progress in question answering, however recent studies show that these models implicitly assume that the answer and evidence appear close together in a single document. In this work, we propose the Coarse-grain Fine-grain Coattention Network (CFC), a new question answering model that combines information from evidence across multiple documents. The CFC consists of a coarse-grain module that interprets documents with respect to the query then finds a relevant answer, and a fine-grain module which scores each candidate answer by comparing its occurrences across all of the documents with the query. We design these modules using hierarchies of coattention and self-attention, which learn to emphasize different parts of the input. On the Qangaroo WikiHop multi-evidence question answering task, the CFC obtains a new state-of-the-art result of 70.6% on the blind test set, outperforming the previous best by 3% accuracy despite not using pretrained contextual encoders.
- Europe > Norway (0.14)
- Europe > United Kingdom > England > Cumbria (0.14)
- Europe > Denmark (0.14)
- (93 more...)
- Media > Film (1.00)
- Leisure & Entertainment (1.00)
- Government (1.00)
- (2 more...)
A Joint Optimization Model for Image Summarization Based on Image Content and Tags
Yu, Hongliang (Peking University) | Deng, Zhi-Hong (Peking University) | Yang, Yunlun (Peking University) | Xiong, Tao (The Johns Hopkins University)
As an effective technology for navigating a large number of images, image summarization is becoming a promising task with the rapid development of image sharing sites and social networks. Most existing summarization approaches use the visual-based features for image representation without considering tag information.In this paper, we propose a novel framework, named JOINT, which employs both image content and tag information to summarize images. Our model generates the summary images which can best reconstruct the original collection. Based on the assumption that an image with representative content should also have typical tags, we introduce a similarity-inducing regularizer to our model. Furthermore, we impose the lasso penalty on the objective function to yield a concise summary set. Extensive experiments demonstrate our model outperforms the state-of-the-art approaches.